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Double-stranded RNAs are an important class of functional 
macromolecules in living systems. They are usually found as 
part of highly specialized intracellular machines that control 
diverse cellular events, ranging from virus replication, antiviral 
defense, RNA interference, to regulation of gene activities and 
genomic integrity. Within different intracellular machines, the 
RNA duplex is often found in association with specific RNA- 
dependent ATPases, including Dicer, RIG-I and DRH-3 proteins. 
These duplex RNA-activated ATPases represent an emerging 
group of motor proteins within the large and diverse super 
family 2 nucleic acid-dependent ATPases (which are historically 
defined as SF2 helicases). The duplex RNA-activated ATPases 
share characteristic molecular features for duplex RNA 
recognition, including motifs (e.g., motifs I la and Vc) and an 
insertion domain (HEL2i), and they require double-strand RNA 
binding for their enzymatic activities. Proteins in this family 
undergo large conformational changes concomitant with RNA 
binding, ATP binding and ATP hydrolysis in order to achieve 
their functions, which include the release of signaling domains 
and the recruitment of partner proteins. The duplex RNA- 
activated ATPases represent a distinct and fascinating group of 
nanomechanical molecular motors that are essential for duplex 
RNA sensing and processing in diverse cellular pathways. 



Background 

Many cellular processes result in the production of double- 
stranded RNA molecules, including transcription of convergent 
cellular genes or mobile genetic elements, self-annealing of cellular 
transcripts and the replication of common RNA viruses. Duplex 
RNAs are important for numerous cellular functions, includ- 
ing gene regulation, chromatin remodeling, antiviral defense 
and maintenance of genomic integrity 1,2 Most of these processes 
involve the interaction of double-stranded RNAs with conserved 
and highly specialized intracellular machines. Well-characterized 
examples include Dicer and the RIG-I like receptors, as well as 
Dicer-like RNA helicases 1 and 3 (DRH-1 and -3), which are two 
mechanical proteins involved in the RNA interference (RNAi) 
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pathway in worms. 1,3 Although there are fundamental differences 
between these proteins, they share a similar, highly conserved 
motor domain that is essential for duplex RNA sensing, signaling 
and processing. This domain is similar in sequence and form, if 
not function, to the helicase domain that is found in many DNA 
and RNA remodeling proteins. 4,5 

Helicases have been classically defined as enzymes that cou- 
ple ATP hydrolysis to the unwinding of nucleic acid duplexes, 
and they were originally phylogenetically grouped into families 
based on sequence conservation rather than function. 6 However, 
these family members were subsequently shown to have diverse 
mechanical functions, of which duplex unwinding is only one 
type of activity. Therefore, these enzymes are now commonly 
referred to as nucleic acid remodeling proteins or, perhaps more 
correctly, as nucleic acid-dependent ATPases. 5,7 Other classifi- 
cations have grouped these proteins by their nucleic acid target 
(RNA or DNA), the nucleic acid strandness (a for single stranded 
NA or (3 for double stranded NA) and the translocation polarity 
on the nucleic acid (A for 3' to 5' or B for 5' to 3') as defined by 
Wigley et al. 8 Sequence and structure analysis have revealed a 
common arrangement of conserved motifs for the Superfamily 1 
and 2 (SF1 and 2) nucleic acid dependent ATPases. 5 In these pro- 
teins, two conserved RecA-like domains lie against each other, 
forming a cleft that binds and hydrolyzes ATP, thereby serving as 
the catalytic core. This ATPase core includes conserved motifs Q, 
I, II and VI (Fig. 1A), which are aligned and rigidified through 
binding of RNA along the surface of the RecA folds. Conserved 
motifs la, lb, Ic, IV, IVa, V and Vb mediate RNA binding, while 
motifs III and Va help to couple nucleic acid binding with ATP 
hydrolysis. Despite the high degree of conservation in both RNA 
binding and ATPase motifs, SF1 and SF2 proteins have unique 
functions and are usually not interchangeable. Specialization 
in mechanical function and the presence of accessory domains 
makes each nucleic acid-dependent ATPase unique. 4 These 
enzymes are involved in every aspect of nucleic acid metabolism 
in all living organisms and viruses. 5,9 " 11 Because of their conserved 
molecular functions, they are also heavily involved in genetic, 
autoimmune and infectious diseases, and they are potential tar- 
gets for drug discovery. 

Recently, significant progress has been made in our under- 
standing of RNA-dependent ATPases, including the identifi- 
cation and characterization of new examples like DRH-1 and 
DRH-3 12,13 from nematodes and new structural and functional 
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Figure 1. DRAs and related nucleic acid-dependent ATPases. (A) The conserved SF2 ATPase/ 
helicase core. The figure is prepared from elF4alll, which is a component of the exon junction 
complex (PDB: 2J0S). Positions of the characteristic motifs are highlighted. (B) Domain orga- 
nization of DRAs. Domains are not to the scale. C-terminal regions that resemble the ATPase 
domains of Dicer and FANCM-like proteins are simplified and are not labeled. (C) Schematic 
cladogram showing DRAs within the SF2 family of proteins, specifically the DEAD box family 
and double stranded nucleic acid binding ATPases. The alignment and family trees were 
determined with the UGENE software package. 118 The multiple sequence alignment was run 
with T-Coffee 119 on the core ATPase/helicase domains listed in Table 1 and the family tree 
was determined using the PHYLIP Neighbor Joining method with the Jones-Taylor-Thornton 
distance matrix. Pair-wise sequence identity for the ATPase core regions of DRAs range from 
the highest 42% (hsMDA5: hsLGP2) and 36% (hsRIG-l: hsMDA5), to 26% (hsRIG-l: ceDRH-1) and 
21% (ceDRH-1: ceDRH-3), with the lowest 14% between hsDicer! and ceDRH-3. 



surveillance protein for detecting pathogenic 
RNA; 15 and DRH-3, Dicer-related-helicase-3, 
a component of the siRNA pathway from 
Caenorhabditis elegans. 12,13 The mechanistic 
feature shared by all these proteins [hereaf- 
ter named Duplex RNA-activated ATPases 
(DRAs)] is that dsRNA is required to stimu- 
late their ATPase activity and thereby activate 
all subsequent functions, which is in sharp 
contrast to other SF2 proteins that are spe- 
cifically activated by single-stranded RNA. 
Further, unlike the bona fide RNA helicases, 
DRAs are unlikely to display RNA unwind- 
ing activity. 1617 Rather, the conformational 
changes that occur upon binding to RNA and 
ATP are coupled to other processes, such as 
the release of signaling domains and binding 
to partner proteins. Here we review the dis- 
covery of the DRAs, highlight recent advances 
in understanding of their function and discuss 
how this is related to their structural features. 

Comparison of DRAs with 
Related Mechanical Proteins 



insights of known cases like Dicer 14 and RIG-I. 15 In this review, 
we focus on this emerging group of specialized RNA-dependent 
ATPases that include Dicer, a ribonuclease that plays an essential 
role in miRNA and siRNA biogenesis; 14 RIGT, an intracellular 



DRA proteins are phylogenetically classified 
as a subgroup within Helicase Superfamily 2 
(known as SF2 proteins, Fig. 1C), 4 and pres- 
ent a core ATPase domain that is very similar 
in both sequence and structure to the DEAD 
box family ATPases/Helicases. Unlike DEAD 
box proteins, DRAs contain a unique a-helical 
insertion domain (HEL2i) within the sec- 
ond RecA fold of the core ATPase /Helicase 
domain. Structural studies have shown that 
this adaptation is important for duplex RNA 
binding (Fig. IB and C). 17 ' 1 '' As the clos- 
est phylogenetic relatives of of RIG-I, innate 
immune sensors MDA5 and LGP2 are more 
similar to each other than to RIG-I, although 
there are conflicting reports on which of these 
proteins should be considered the evolution- 
ary antecedent of the others. 20,21 A constructed 
family tree of SF2 proteins from several sub- 
groups suggests that DRH-1 is more closely 
related to RIG-I than to Dicer (Fig. 1C), 
underscoring the difficulties in naming these 
proteins based on functional associations. 

Perhaps most significant given their func- 
tion, DRAs are most closely related to proteins 
that act not on RNA, but on double-stranded 
DNA (Fig. 1C). The DRAs are relatives of the 
FANCM family of proteins that function during DNA repair, 
and these include Hef, FANCM, Mphl and Fmll (www.rnahe- 
licase.org/rig.htm database) (Fig. 1 and Table 1). Members of 
the FANCM family bind dsDNA and contain a similar a-helical 
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Table 1. SF2 nucleic acid-dependent ATPases listed in this study 



Family 


NA 


SF2 


Organism * 


Function 


Sequence ID b 


PDB Code c 


DEAD 


RNA 


Eif4AIII 


hs 


Exon junction complex 


P38919 


2J0Q, 2J0S, 2J0U, 2HYI, 2HXY 


DEAD 


RNA 


VASA 


dm 


Germ cell development 


P09052 


2DB3 


DEAD 


RNA 


Mssll6 


sc 


Group II intron splicing 


PI 5424 


3I5X, 3I5Y, 3161, 3162 


DEAD 


RNA 


Ddxl9 


hs 


Nuclear mRNA export 


Q9NUU7 


3EWS, 3G0H 


Swi/Snf 


DNA 


Rad54 


hs 


DNA repair 


Q92698 


1Z31, 1Z63, 1Z6A 




DNA 


RapA 




TV 9 n cpntiti r\r\ rpfrnl^ti^n 
1 IculSH ipilUll ICgUlcUIUll 


P60240 




Swi/Snf 


DNA 


Chdl 


hs 


Chromatin remodeling 


014646 


3MWY, 3TED, 2XB0 


RecG 


DNA 


RecG 


ec 


DNA repair 


P24230 


1GM5 


RecG 


DNA 


PriA 


ec 


DNA replication 


P17888 


None 


RecG 


DNA 


MFD 


ec 


DNA repair 


P30958 


None 


T1R 


DNA 


EcorA 1 


ec 


DNA restriction 


Q07736 


2W00(Ecorl24I) 


Tl R 


DNA 


EcorE 1 


cc 


DNA restriction 


Q47281 


?W00 (Fmr\ 9411 


T1R 


DNA 


EcorKl 


ec 


DNA restriction 


P08956 


2W00(Ecorl24I) 


DRA 


RNA 


RIG-I 


hs 


Innate immunity 


Q95786 


2AY2, 4A2P, 4A2Q, 4A2W, 4A36, 3TBK, 2YKG.3TMI 


DRA 


RNA 


MDA5 


hs 


Innate immunity 


Q9BYX4 


None 


DRA 


RNA 


LGP2 


hs 


Innate immunity 


Q96C10 


None 


DRA 


RNA 


Dicer 


hs 


Gene silencing 


A9UPY3 


None 


DRA 


RNA 


DRH-1 


ce 


Gene silencing 


G5EDI8 


None 


DRA 


RNA 


DRH-3 


ce 


Gene silencing 


Q93413 


None 


FANCM 


DNA 


FANCM 


hs 


DNA repair 


Q8IYD8 


None 


FANCM 


DNA 


MPH1 


sc 


Genome stability 


P40562 


None 


FANCM 


DNA 


Hcf 


pf 


DNA repair 


Q8TZH8 


1WP9 



Notes: Members of DRA family are boxed and bolded. a Abbreviations: ce, Caenorhabditiselegans; dm, Drosophila melanogaster; ec, Escherichia coli; hs, 
Homo sapiens; sc, Saccharomyces cerevisiae; pf, Pyrococcusfuriosus. b Sequence ID refers to the protein sequence taken from the UniProt Knowledgebase 
(www.uniprot.org). c PDB Code refers to the structures available from Protein Data Bank (www.rcsb.org). 



HEL2i insertion as the DRAs. 22 Other closely related dsDNA 
binding proteins include members of the Swi/Snf, RecG and the 
TlR families (Fig. 1C and Table l). 4 Swi/Snf proteins have a 
six a helix motif that is inserted within two parts of the HEL2 
domain. This motif interacts with the 5' strand of dsDNA and 
occupies the same position relative to HEL1 and HEL2 as the Hef 
and the RIGT HEL2i domain. 23 " 25 Interestingly, RecG proteins 
have a TRG motif (translocation in RecG) located immediately 
after motif VI that forms a helical hairpin proposed to be a trans- 
mission system for driving double-stranded translocation. 26 This 
hairpin appears to be analogous to the pincer domain of RIG-I 
and other DRAs, although the pincer domain is significantly lon- 
ger and more complex. 27 Similarly, members of the prokaryotic 
TlR family have an ct-helical domain at their C terminus that 
plays a role in recognizing foreign dsDNA, either from transmis- 
sible plasmids or from phages. 28 

In addition to the conserved ATPase/Helicase core and the 
specialized RNA recognition and transduction domains, DRAs 
contain two semi-conserved motifs that contribute to binding of 
double-stranded nucleic acid (Fig. 2). In these proteins, motifs 
Ha and Vc form contacts with the 5'-3' "second strand," which 
contrasts with the other nucleic acid binding motifs that contact 
only the 3'-5' "tracking strand" that is bound by all transloca- 
tive helicases. Motif Ila in particular was initially noted in the 
Sulfolobus solfataricus Rad54 structure and later within the motor 
subunit of the TlR restriction modification enzyme, EcoR124, 
from Escherichia coli. 25 ' 2 '' Not surprisingly, motif Ila also appears 
to be present in members of the DEAD box family, and its func- 
tional role in duplex RNA binding is supported by recent struc- 
tural studies of DEAD-box protein Mssll6 bound to duplex 
RNA. 30 Motifs Ila and Vc are primarily conserved structurally 



rather than in primary sequence (Fig. 2) . 17,23,2 '' Although there are 
semi-conserved lysines and asparagines in motif Ila and a semi- 
conserved asparagine in motif Vc, the majority of the contacts 
made with nucleic acids involve the peptide backbone. 

By contrast, RNA-dependent ATPases that function as helicase 
enzymes preferentially bind single stranded nucleic acid before 
unwinding adjacent duplex regions. Two distinct mechanisms of 
unwinding have been proposed, and these include melting of the 
RNA backbone through local distortions of the A-form duplex 
RNA, as hypothesized for the DEAD box family of helicases 
including Dedlp and Mssll6p, 31 and displacement of adjacent 
duplex strands during translocation, as shown for the viral SF2 
DExH helicases including NS3 32 and NPH-II. 32,33 In contrast to 
these bona fide helicases, DRAs preferentially bind RNA duplex 
instead of single stranded regions, 16,34,35 and in all existing struc- 
tures of RNA-RIGT complexes, the duplex RNA maintains an 
undistorted A-form conformation. 17 " 19,36 Furthermore, the crucial 
(3-hairpin motif that participates in strand separation by DExH 
helicases is missing in RIG-I and other DRAs. 17,37,38 Therefore, 
because DRAs have no structural features designed to disrupt 
duplex RNA, and no structural motifs for coupling translocation 
with strand separation, it is not surprising that DRAs have not 
yet been shown to function as unwindases. 

The Molecular and Structural Biology of DRAs 

Structural studies on nucleic acid-dependent ATPases are ham- 
pered by the intrinsic flexibility that arises from their function as 
molecular motors. In these proteins, the two conserved Rec-A like 
domains are loosely connected in the absence of nucleic acid and 
ATP. This is particularly true for DEAD box proteins, which tend 
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Figure 2. Sequence and structural features of RIG-I that contribute to duplex RNA recognition. 
(A) Sequence alignment of DRAs and other SF2 proteins. Notice motifs Ma and Vc (boxed in dot- 
ted lines) are not very conserved in amino acid sequence. (B) HEL2i domain juxtaposes with the 
duplex RNA backbone (PDB codes: 3TMI in green; 2YKG in yellow; 4A36, in magenta). (C) Special- 
ized motifs lla and Vc recognize the top strand of the duplex RNA (Botton strand or tracking 
strand is the strand nucleic acid that binds to the SF2a proteins; Top strand is the complementa- 
ry strand). (D) Possible structural conservation of motif lla and motif Vc found in DEAD-box RNA 
family members. Figure shows the aligned structures of DEAD-box protein:ssRNA complexes 
with duplex RNA (PDB codes: 2J0S, 3I5X, 3G0H and 2DB3). The possible presence of motif lla and 
Vc in DEAD-box proteins are labeled in parenthesis. 



to be captured crystallographically only in 
the presence of both ssRNA and ATP ana- 
logs, 39 " 43 thereby limiting our understanding 
of their functional cycles. Structural studies 
of DRAs face the same challenges as those 
focused on DEAD box proteins. Adding 
to this difficulty, DRAs are large multi- 
domain proteins with several moving parts 
that usually function within even larger pro- 
tein complexes. Nevertheless, recent cryo- 
electron microscopy studies of Dicer 44 " 46 
and crystallographic studies on RIG-I have 
advanced our understanding of the biologi- 
cal function and mechanical properties of 
DRA proteins. 1517 " 19 

Dicer: the small RNA processing 
machine. Ever since Fire et al., published 
the groundbreaking paper on RNA inter- 
ference (RNAi), 47 great strides have been 
made in understanding the biogenesis and 
functional mechanisms of the small RNAs 
that facilitate dsRNA-mediated gene regula- 
tion. 1 ' 48 " 53 There are two major types of small 
RNAs: microRNA (miRNA) and small 
interfering RNA (siRNA). While these two 
RNAs differ in their pathway of biogenesis, 
they share similarities in function. Of cen- 
tral importance to the RNAi pathway is 
the formation of an RNA-induced silencing 
complex (RISC), 54,55 which binds to the tar- 
get mRNA and results in downregulation of 
gene expression by either RNA degradation 
or translational arrest. 1,52 RISC is assembled 
from the RISC-loading complex (RLC), 
which includes dsRNA, Ago, Dicer and 
other additional dsRNA binding and acces- 
sory proteins. 56,57 

Dicer plays a major role in gene regula- 
tion by processing dsRNA precursors into 
short fragments that are used to target the 
silencing of specific genes. 58 " 61 Dicer cleaves 
long duplex precursor RNAs (pre-miRNA 
or pre-siRNA) into short miRNA and 
siRNA fragments and then loads the correct 
"guide" strand into RISC. Dicer received 
its name because of its dsRNA cleavage, or 
"dicing" activity. 58 Phylogenetically, Dicer 
is a class III RNase, members of which are 
conserved among eukaryotic species. 62 In 
humans, there is only one Dicer protein, 
hsDicerl, however Drosophila and plants 
contain two and four Dicer proteins respec- 
tively. Human Dicerl mutations have been 
found in various cancer syndromes, 63 " 65 
emphasizing its fundamental roles in gene 
regulation. In general, all Dicer and Dicer 
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like proteins (DCL) from eukaryotic species share a similar 
domain architecture, containing a SF2 RNA-dependent ATPase 
domain at the N terminus, a DUF283 domain (Domain of 
Unknown Function), a PAZ domain, two tandem RNase III 
domains and a dsRNA-binding domain (dsRBD) at the C termi- 
nus 14,44 (Fig. IB). The first structural insights into Dicer compo- 
nents came from a crystal structure of a Dicer homolog obtained 
from the unicellular eukaryote Giardia intestinalis. This struc- 
ture revealed a specific spatial arrangement of the PAZ domain 
relative to the two RNaselll domains, 66 suggesting that Dicer 
contains a molecular ruler that enables it to generate dsRNA 
fragments of specific length. Unfortunately, unlike Dicer genes 
from other organisms, Giardia intestinalis Dicer does not contain 
an ATPase motor domain. 

The ATPase motor domain of Dicer is highly conserved across 
species and it is phylogenetically distinguishable as a DRA pro- 
tein (Fig. 1C). The precise biochemical function of the motor 
domain is still unclear, and it is not yet known whether the RNA- 
dependent ATPase activity is actually linked to duplex unwind- 
ing, and whether the motor domain behaves like a helicase. 
Recent studies have indicated that it plays a role in helping to 
select the "guide strand" from the two duplex strands that are ini- 
tially bound within the RISC-loading complex. This is accom- 
plished by sensing thermodynamic features of the RNA duplex, 
and determining which terminus is more easily opened. 67 The 
selected siRNA guide strand will then be loaded into the Ago 
protein, resulting in formation of a functional RISC complex. 67 

The overall three-dimensional architecture and domain orga- 
nization of Dicer is well conserved among orthologs. 14,44 Dicer 
adopts an L-shape as determined by negative-stain electron 
microscopy (EM). 44 Using a streptavidin tagging method and 
domain deletion constructs, Lau et al. accurately located the posi- 
tion of the motor domain at the base of the L shaped structure 
(Fig. 3). Furthermore, when the motor domain of the RIG-I was 
docked into the EM structure of Dicer, the RNA binding inter- 
faces of the motor domain and the RNase III domain creates an 
adjacent central RNA binding groove. 44 A complex between Dicer 
and its TRBP (TAR RNA Binding Protein, an accessory protein 
of Dicer and a dsRBD) forms a similar L shape with a long edge of 
150 A and a 100 A extension at the bottom end. 45 Because of the 
small size and intrinsic flexibility of TRBP, it is difficult to accu- 
rately assign its location, particularly in the absence of a siRNA or 
miRNA substrate. A low resolution EM structure of the human 
RISC-loading complex (containing Dicer, AG02 and TRBP in 
a 1:1:1 stoichiometric ratio) was obtained by crossing-linking the 
complex. In the resulting model, AG02 was proposed to interact 
with the C-terminal region of Dicer. 68 The RNA binding site of 
AG02 is located in close proximity to the C-terminal region of 
Dicer, pointing away from the N-terminal motor domain. This 
model is consistent with biochemical data suggesting that the 
motor domain of Dicer may not be required for loading mature 
siRNA into the AG02. 69 Two discrete conformations of the 
Dicer motor domain have been identified, suggesting that it may 
adopt multiple conformations on dsRNA. 44 This structural flexi- 
bility contributes to specific dsRNA recognition and may support 
a processive dicing mechanism (Fig. 3). 35,70,71 Not surprisingly, it 




c ► 




is similar to structural rearrangements observed when the RIG-I 
motor domain binds to RNA duplex. 18,19 

RIG-I: The innate immune sensor for viral RNA detection 
and defense. A diverse group of cytoplasmic surveillance pro- 
teins sensitively detect the presence of viral genomes and gene 
products and then initiate inflammatory responses that enable 
vertebrates to fight viral infections. 72,73 These proteins form the 
foundation of our innate immune response. The RIG-I-like 
receptors (RLRs) are a specialized subclass of DRA proteins 
that detect double stranded viral RNAs in the cytoplasm and 
initiate a series of signaling events to elicit an antiviral response. 
The RLR motor proteins include RIG-I, 74 MDA5 (Melanoma 
Differentiation Associated gene 5) 34 and LGP2 (Laboratory of 
Genetics and Physiology 2). 75 They were initially identified in 



Figure 3. (A) Segmented map of human Dicer with crystal structures 
of homologous domains docked. (B) Model for pre-miRNA recognition. 
A pre-miRNA hairpin is modeled into the proposed binding channel 
of Dicer, with the stem-loop fit in the RNA-binding cleft of the protein. 
(C) Schematic for processive dicing in which dsRNA is translocated into 
the nuclease core (1). The PAZ domain (purple) recognizes the dsRNA 
end, positioning RNase III (orange) for cleavage (2). The siRNA product 
is released while the dsRNA substrate remains bound to the protein (3). 
Reprinted with permission from Lau et al. 44 
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different biological contexts and were later re-discovered to be key 
members in antiviral innate immunity. 72 ' 73,76 RIG-I is the most 
extensively studied member of the RLRs and has been demon- 
strated to be the major antiviral RLR. RIG-I recognizes a broad 
range of viruses, including negative stranded viruses, e.g., vesicu- 
lar stomatitis virus, Sendai virus, influenza virus and rabies virus; 
positive stranded viruses such as dengue virus, Japanese encepha- 
litis virus, West Nile virus and Hepatitis C virus; dsRNA virus 
(reovirus) and DNA virus (Epstein-Barr virus). 3,77 MDA5 is both 
structurally and functionally similar to RIG-I and complements 
RIG-I by recognizing a distinct set of virus RNAs although there 
might be some overlap. 3 LGP2 is thought to serve as a feedback 
regulator but its exact function is still not clearly defined. 78,79 

RIG-I contains two tandem caspase activation and recruitment 
domains (CARDs; CARD1 and CARD2) at its N-terminus, 
which mediate a downstream signaling relay; a central DRA 
motor domain, and a C-terminal domain (CTD) that facilitates 
viral RNA recognition (Fig. IB), 15 ' 1 ?- 19 ' 80 j t j s common ly believed 
that RIG-I is inactive in resting cells and it is activated upon detec- 
tion and binding of viral RNA. The activated RIG-I is believed 
to hydrolyze ATP and initiate a signaling cascade and type I 
interferon (IFN) response via the adaptor protein MAVS, also 
known as IPS-1, VISA or CARDIE 3 MAVS in turn activates sev- 
eral transcription factors including IRF3, IRF7 and NF-kB, and 
leads to the production of IFN and inflammatory cytokines. 77,80,81 
Moreover, RIG-I displays apoptosis-inducing properties in tumor 
cells. 82,83 Effective therapeutic RIG-I antagonists and agonists may 
provide new tools for the treatment of viral infections and cancer. 84 

Recent research has focused on characterizing the molecu- 
lar determinants for RNA-RIG-I recognition, the mechanisms 
of activation and signaling, and regulatory pathways that help 
control RIG-I signaling. Structural and biochemical studies 
on RIG-I have revealed that 5' tri-phosphorylated blunt-ended 
duplex RNAs are the optimal substrate for RIG-I binding and 
activation. 17 " 19,36,73,85,86,120 The exact length of the duplex is unclear 
although it is generally accepted that RIG-I recognizes RNA that 
is ten to hundreds of base pairs in length, while MDA5 forms fil- 
aments on longer RNA in the thousands of base pairs. 85,87 " 90 The 
CTD is primarily responsible for 5'tri-phosphate recognition and 
both the CTD and helicase domain form critical contacts with 
the RNA duplex. 15,18,19,85,91 " 93 The latest structural studies indi- 
cate RNA binding induces a dramatic conformational change in 
RIG-I (Fig. 4A and B). The role of ATP binding and hydroly- 
sis has not been determined, although mutations in the ATPase 
domain are clearly deleterious to function. 15,18,19 Post translational 
modifications of RIG-I, including ubiquitination, phosphoryla- 
tion and SUMOylation, have been reported to be important 
for its function. 54 " 96 Non-covalent polyubiquitin binding to the 
CARDs is likely to be essential for full activation of RIG-I and 
possibly oligomerization. 97 ' 98 

In the resting state, RIG-I adopts an autoinhibited confor- 
mation in which the motor domain is sterically blocked. 15,18 The 
CARDs are trapped in a fixed conformation relative to the HEL 
domains (synonymous with Rec-A folds 1 and 2) through an 
interaction between the second CARD and the insertion domain 
(HEL2i) (Fig. 4A). 18 This conformation was speculated to inhibit 



both CARD1 ubiquitination by the ubiquitin E3 ligase TRIM25 
and non-covalent binding of polyubiquitin to the CARDs, both 
of which are required for RIG-I activation. In the apoenzyme 
state the RNA binding surfaces of RIG-I (and particularly the 
CTD) are largely exposed, allowing RIG-I to search for viral 
RNAs. The CTD, which is connected to the HEL domain 
through a long and flexible pincer domain, enhances the specific- 
ity of RIG-I for tri-phosphorylated RNA. 18 

It is believed that viral activation of RIG-I signaling occurs in a 
carefully choreographed sequence of events. Binding of viral RNA 
is the initial trigger for RIG-I activation, whereupon the motor 
domain (comprised of HEL1, HEL2 and HEL2i) of RIG-I forms 
a ring-shaped clamp around the sugar-phosphate backbone of the 
duplex and the CTD caps the helical terminus, even in the absence 
of a 5' triphosphate. The tight and specific interaction of the CTD 
with the duplex terminus may prevent RIG-I from binding with 
high affinity to internal sites on the duplex 17 " 19,36,120 (Fig. 4B). 
Structural analysis suggests that binding of RIG-I to RNA alone 
may not be sufficient to disrupt the autoinhibitory interaction 
between the CARD2 and HEL2i domains, 19 hinting that an addi- 
tional trigger might be needed to activate signaling. In the crystal 
structures of RIG-I :dsRNA with AlFx and BeFx, ATP binding 
appears to bring the RIG-I helicase into a more closed and com- 
pact conformation relative to RIG-I structures that contain only 
dsRNA (Fig. 4C). 17,18,36 This ATP-induced conformational change 
shifts the CTD and HEL2i toward each other, resulting in a clash 
between the CARDs and CTD (Fig. 4C). Consequently the struc- 
ture is likely to reorganize, reorienting the relative positions of the 
CARDs and HEL2i, and potentially releasing the CARDs which 
makes them available for interaction with MAVS and activates the 
innate immune response (Fig. 4D) 97,98 

In agreement with this idea, ATP is required for in vitro 
reconstitution of the RIG-I signaling pathway, 57,9 ' 1 although ATP 
hydrolysis and turnover is not essential. 100 Activation of RIG-I is 
therefore a tightly-regulated, multi-checkpoint process, starting 
with recognition of the correct RNA substrate, followed by ATP 
binding, and then subsequent coupled structural rearrangements 
that release auto-inhibition and switch RIG-I into a signaling- 
competent state (Fig. 4). 15,101 

DRH-3: Attenuating the siRNA pathway in Caenorhabditis 
elegans. A group of endogenous siRNAs, named 22G endo- 
siRNA, from Caenorhabditis elegans are linked to a variety of 
biological processes that are vital to maintaining genetic stability, 
including transposon silencing and chromosome segregation in 
germline cells. 102 " 104 Defects in the endo-RNAi pathway can result 
in many forms of genetic instability such as loss of chromosomes 
during mitosis, abnormal gene expression and increased sensitiv- 
ity to X-ray irradiation. 102 " 105 These siRNAs are classified as sec- 
ondary siRNA molecules because they are produced directly by 
RNA-dependent RNA polymerase (RdRP) transcription, with- 
out a double-stranded RNA intermediate or cleavage. 106,107 

Dicer-related helicase 3 (DRH-3) is a large multi-domain, 
multi-functional protein that is essential for the biogenesis of 
these endogenous secondary siRNAs. 12,13,103,104 DRH-3 interacts 
with members of the C. elegans RNAi machinery, including Dicer 
(DCR-1) and the RdRP, RRF-1. 12,13 A large protein (1119 amino 
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acids, ca. 130 kDa), DRH-3 contains 3 sub-domains (Fig. IB). 
These include an N-terminal domain of novel sequence that lacks 
a known homolog, and the central motor domain that is common 
to DRA proteins. 510,108 The C-terminal domain of DRH-3 shares 
sequence similarity with the CTD that plays an important role in 
RNA and triphosphate recognition by RIG-I. Although there are 
no structural studies of DRH-3, a preliminary biochemical char- 
acterization has been reported. Several key features of DRH-3 are 
now apparent: the protein binds more strongly to dsRNA than to 
ssRNA; potent ATP hydrolysis by DRH-3 is only stimulated by 
dsRNA; DRH-3 does not have unwinding activity. 16 DRH-1, a 
homolog of DRH-3, is implicated in both germline and somatic 
RNA interference (RNAi) pathways as well as virus sensing and 
viral siRNA formation 109,110 and may be an equally important tar- 
get for future biochemical and structural studies. 

DRH-3 has a domain organization that is very similar to 
RIG-I. It is tempting to speculate that DRH-3 might bind to the 
RNA duplexes generated by the endogenous siRNA pathway and 
recruit signaling partners through its NTD. This speculation is 
further supported by the absence of helicase activity and the pref- 
erence for canonical RNA duplex binding. 16 

Concluding Remarks and Future 
Directions for Research on DRA Proteins 

DRAs share several characteristic features that distinguish them 
from other groups of RNA-dependent ATPases. First, in addi- 
tion to the conserved motifs that classify DRAs as SF2 RNA- 
dependent ATPases, DRAs contain unique motifs (e.g., motifs 
Ha and Vc) and domains (HEL2i) that specialize in duplex 
RNA recognition. Second, although the literature on DRAs is 
somewhat unclear on this point, DRAs do not appear to possess 
RNA unwinding activity and they may accomplish their biologi- 
cal function by simply binding duplex RNA or by translocating 
along the duplex without unwinding. Lastly, the DRAs discussed 
in this review are all part of larger protein complexes that func- 
tion in duplex RNA sensing and processing. 

One of the most intriguing questions about DRAs is whether 
they, in fact, require ATP hydrolysis for function. At the present 
time, it is not established that DRAs require ATP binding and/ 
or hydrolysis and, like DEAD-box proteins, they may only utilize 
ATP for recycling. ATPase activity is unnecessary for pre-miRNA 
processing by human Dicer, 35,69,111 but in contrast, Drosophila 
Dicer-2 appears to require ATP for siRNA production. 55,112 The 
ATPase motor domain from C, elegans DCR is required for the 
biogenesis of some but not all siRNAs. 113 Evidently, there is no 
consensus for the function of ATP hydrolysis by Dicers from dif- 
ferent species. RIG-I has shown a clear dependence on ATP for 
in vitro reconstitution, 97,98 but mutagenesis studies by Bamming 
and Horvath suggest that signaling by RIG-I and MDA5 can 
occur independent of ATPase enzymatic activity. 100 To recon- 
cile this, recent structural data suggests that ATP binding but 
not necessarily hydrolysis induces a conformational change on 
the RIG-I helicase domain that may eventually lead to RIG-I 
activation (Fig. 4). Further experiments are needed to verify this 
structure-driven hypothesis. 




(yellow) closes the HEL domains and causes a clash between the CARDs 
and CTD (PDB: 4A2W, 4A36 and 2YKG). (D) The change in conformation 
upon dsRNA and ATP binding releases the CARD domains for signaling. 



Although the recent crystal structures of RIG-I advance our 
understanding of DRAs, there are several questions that remain 
unanswered. One question is whether DRAs recognize specific 
RNA sequences or structures. Several studies suggest that Dicer 



Figure 4. Structural basis for dsRNA recognition and activation of RLRs. 
Models were created by aligning and merging known duck and human 
RIG-I structures and considering our recent solution hydrodynamic stud- 
ies on RIG-I conformational dynamics upon RNA and ATP binding. 36 (A) 
Model of full length RIG-I apoenzyme based on structures of duck RIG-I 
(PDB: 4A2W) and the CTD (PDB: 4A2V). In the autoinhibited conformation, 
the N-terminal CARDs are sequestered from signaling and maintain RIG-I 
in an autoinhibited state. (B) RIG-I switches into a semi-closed conforma- 
tion upon RNA binding. Binding of dsRNAtothe CTD brings the HEL 
domains in contact with dsRNA (PDB: 4A2W and 2YKG). (C) ATP binding 
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recognizes specific terminal structures of RNA as evidenced by 
the fact that human dicer is more efficient in processing pre- 
miRNA than siRNA. 71 Furthermore, Drosophila Dicer-1 rec- 
ognizes the terminal loop structure of pre-miRNAs through its 
motor domain, 70 and both Drosophila Dicer-2 and C. elegans 
DCR-1 differentiate the end structures of long duplex RNAs for 
endo-siRNA processing. 114 This suggests there might be a special 
structural feature in the Dicer RNA-dependent ATPase domain 
that is responsible for recognizing RNA ends. RIG-I specifically 
recognizes tri-phosphorylated RNA through its accessory CTD 
domain, and given that RIG-I recognizes a broad but distinct set 
of RNA viruses, it will be interesting to determine if RIG-I can 
recognize unique viral RNA sequences or structures, in addition 
to 5' triphosphate and duplex RNA. 

Oligomerization is a variable characteristic that can be impor- 
tant for the function of SF2 proteins, including DRAs. To gener- 
alize, the minimum functional unit of SF2 proteins is monomeric, 
but there is biochemical evidence suggesting that oligomerization 
can enhance biological activity of certain SF2 proteins. 115 ' 116 As 
for DRAs, there is no indication that Dicer forms oligomers, 
however RIG-I has been proposed to dimerize 93 or tetramerize 97 
upon activation. The exact molecular basis for RIG-I oligomer- 
ization is still unknown; however the downstream target of RIG- 
I, MAVS, forms filaments upon activation. Distinct from RIG-I, 
MDA5 cooperatively binds to long duplex RNAs and may form 
a filament-like structure itself, 115,117 which would be unique not 



only among DRAs but also SF2 proteins in general. It is therefore 
important to establish whether oligomerization is obligatory for 
RIG-I or MDA5 function. 

Given the central role of DRA proteins in diverse cellular 
pathways that range from epigenetic regulation to the innate 
immune response, it will be interesting to characterize the regu- 
latory cofactors that help to specify and control the activity of 
DRAs. It will also be exciting to identify new DRA proteins that 
have distinct molecular functions. The DRA proteins seem to be 
markers of interesting biology, and investigation of this protein 
family will continue to yield major insights into the nanome- 
chanical features of living systems. 

An informative review on the molecular mechanism of RIG-I 
activation was published while this manuscript was under review, 
which is agreeable to our model (see ref. 121). 
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